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A Program for Tranafoxmatlonal Syntactic Analysis 

By S« H. Fetrlck 



Introduction 



We have heen concerned vlth the problem of syntactic 
£uialysls with respect to a transformational grammar for more 



them three years now^ and during that time ve have developed 
a sequence of computer programs that Implement our Ideas on 



trcuis format lonal ahalysls. These programs for the most part 
have been vrltten In the LISP programming language and debugged 
on the APCHL Unlvac M>«460 ItlSP System* As might be expected 
the basic program has undergone e continual sequence of changes* 
These modifications stem from sever£kl causes among vhlch are 
the follGHlngi 

(1) Analysis of sentence;^} vlth respect to specific 
transformational grammars has uncovered several fundamental 
errors In our analysis procedure* This caused us to modify 
our analysis algorithm and^ of course^ to make corresponding 
changes to the liCSP Implementation of that algorithm. 

(2) - Changes have been made in the underlying class of 

f 

transfozmtlonal grammars for vhlch the analysis procedure 
Is valid* These changes vere made to permit the consideration 
of grammars that reflect current deyelopments in transfozma- 






tlonal theory* 
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(S) Actual user experience has dictated certain changes 
In input and output foimats* Several output options have 
heen added to aid In the debugging of transfoxnatlonal gramnars* 

(4) Excessive time requirements encountered In the 
anaO^ais of apaclfio sentences have suggested changes vhioh 
have Increased the efficiency of the analysis program* 

1 

Changes of the last three types are still being made* 

In spite Of this It seems appropriate at this time to document 
and make generally available a version of the program tfhlch has 
i been tested on the IBM 7044 and 7094 LISP Systems as veil as 
on the Chlvac M><466 10[SP S|ystem« It is hox>ed that this program 
vlll be of use to transformational grammar vrlters vho vlsh 
to verify that their grammars assign structural descriptions 
as Intended* It Is further hoped that the availability of 
this program may encourage people vho are interested in vorklng 
toward the realization of a natural l£ijQguage computer Input 
capability to consider the problem of mapping structural 

« 

descriptions cusslgned to a sentence by a transformational 
grammar into appropriate computer code* 

« 

In this paper ve vlll precisely define a claes of 
transformational grammars and ve vlll describe curd document 
a computer program for the analysis of sentences vlth respect 

• • 
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to grammars of this class* This p:ix)gram is at present unique 
in constituting an analysis procedure that Is ‘valid for a 
non-trlvlal class of transfoimatlonal grammars* We 'vrlll not 
present the linguistic motivation for our formalization of 
transfozmatlonal grammars^ noa? ‘will ve discuss In detail 
the analya:Ls algorithm upon vhlch the computer program la 
.based* These topics are discussed else'where [1/2 ]]• We. 

Will comment briefly on the relationship of our approach 
to that of others/ but this comparison will be deferred 
until ve have discussed our own work* 
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Claee of Grammara Considered 

/ 

ayntactlc component of a transfonnatlonal grammar/ 

* as ve define it/ consists of tvo subcomponents / a *base com- 
ponent and a tranafozmatlonal component . The 1)080 component 
specifies a set of trees or labelled brackettlngs that ve vlU 
' { call '^)ase trees” or ”deep structures ”• The transfozmatlonal 
component maps certain base trees Into other trees that ve 
vlU refer to euei "surface trees", ^ese stirface trees are/, 
of course/ also labelled bxackettlngS/ and their debracketizatlons 
constitute the sentences of the language specified hy the syntactic 
component of a transfozmatlnnal grammar. 

• t 

The base component vlU be taken to be a context free (CF) 
gramnar.^ Ve vlll not formalize a CF grammes: here but vlll 
merely describe those conventions ve hare fixed for our 
convenience} 

(1) ^e distinguished symbol Is SI. 

(2) There Is another syribol COMP/ distinct from SI/ vhlch 
Is expanded only by the rule COMP -♦ SERTB SI S£NTB« 

( 3 } The sentence boundary symbol 8ENTB Is a terminal 
symbol. Neither It not SI appei^s In the right meuiber of 
any rule except the one given above In rule 2. 



^ For a discussion of the case vhere the base component Is a 
contsict sensitive grammar see Section 3*T of reference [2]. 
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Our Input format for the hase component represents 

the CP rule “*^2 *** ^n ^2 ^n^* 

do not mahe use of a notation that eOJLows CP rules vlth 

' « 

optional or disjunctive sets of constltutents^ vhlch are 
usually denoted hy parentheses and hraehets respectively* 
Were ve to allow such notation^ however^ we would not 
^ simply produce the expanded CP gremoax that this notation 
denotes* Instead^ we would oht^^^in an eq,ulvalent granmiar 
using a generalization of the algorithms found in section 
5 reference [3]* 



Por example^ the con^lex CP rule 

fv 



(1) - (Ag)-^ a3 



- A. 



5-* ihi CAia) 




denotes 3Q0 separate CP rules* If, however. In place of this 
rule we substitute the set. of rules (li) helow, we generate 

i'. 

4*he same language » 



(11) Aj^ 
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Purthemore, structural descrlptlona of sentencea vith 

I, 

respect to a grammar 0 containing rule (i) may "be obtained 
from those with respect to a grammar G' In which rules (ii) 
are substituted for rule (i) by mereiy drawing a line throu^ 
all nodes which are nonterminal s^bols of G' but not G. 

It la possible to mechanically produce such a grammar G* that 
Is strongly eq,ulvalent In this sense to a CP grammar G that 
is written in parenthesis and brace notation. Wot only Is 
this more economical of storage space than merely expanding 
G to simple CP rules^ but the analysis of sentences with 
respect to 0^ Is^ at IcELSt for many CP analysis procedures^ 
more efficient than is analysis with respect to the expanded 

t • 

foxm of G* 
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Althoufi^h we could have programmed the mechanical derl- 

« 

vatlon of @g,ulvelent grammars y we did not do so because the 
• base component la not frequently modified during the develop- 
..mont of a transformational grasmari and it Is not difficult 
to. produce a particular equivalent grasnar by hand* IJhio was^ ^ ' 
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for example^ done for a CF grammar' related to the MITBB 
d\mlor Grammar' reducing the nuiaber of CF rules required 

I 

from over 5000 rules to Just 305 rules. 



The escbenslon of the heise component to make use of 
complex features In the lexical revrltlng t!;^les Is currently 
' i being programmed. However^ for the version of the program that 
Is presently avalhlble lexical revrSOng must«he accomplished 
by CF rules. 
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The transfozmatlonal component consists of an ordered 
set of transfoxmatlons j each of vhlch maps a tree structure 
Into emother tree structure. These tmnsfozmatlons are 
applied In order and In a cyclic fashion until they are 

• no longer applicable^ the output of a transformation at 
each stage serving as the Input to the next transformation* 
The first transformation Is applied Initially to a "deep 
structure" given by the base component^ and the ultimate 
structures produced are called surface structures. For 
details on the tr^^nsformatlhnal cycle see reference ([13 

or [23. The reader Is also referred, to those sources 
. for definitions of such terms as structural index and proper* 

analyslsj, to iwhloh m refer In the sequel. 
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We distinguish tvo types of tr^foimations, slngulazy 

• « 

and hinazy^ edthouf^ ve need not have fozmaXly made the 

distinction* Slngulazy transfomations must precede hlnazy 

transfonoations in the ordering sequence* Binary trans- 

fozmations ax^ distinguished hy the fact that they always 

• refer to two levels of emheddlng^ and they always collapse 
'I 

those two levels of emheddlng to a single emhedding level* 
(0?he nth eaibeddlng level is defined as that portion of a 
tree dominated hy precisely n occurrences of a symbol COMP 
whose immediate constituents are SENTB SI SENTB*) The 
structural index of every hlnazy transformation must contain 
two occurrences of the sentence houndazy synibol SEKTB^ 
the structural change must indicate that both occurrences are 
deleted* The nodes which satisfy the terras of the structural 
. index of some slngulazy transformation must all be contained 
in some currently deepest embedding leve3^ so of course no 
slngulazy transformation can have SEKTB as a structural 
■index term nor can any rule schema tezm X of a slngulazy 
trazisfozmation be satisfied by a sequence of trees containing 
the synjbol SERTB* 



Binary transfomnatlons are repeated at one upper embedding level 
^ until no longer applicable* Slngulazy transformations may 



; ^o be designated as BEPEATES In their application at a given 










point in a derivation. Singulaxy transformations may also 
be designated as optional (OFT) or obligatory (OBLIG) vith 
respect to their perfoxmance If applicable. In the present 
■ ' version of our analysis program ve assume all transfoxmatlons 
are optional^ reserving for a synthesis phase the elimination 
of derivations that violate the requirements of some obligatory 
. . transfoxmatlon. Inclusion of optional-obligatory considerations 
into the analysis procedure Itself is discussed in [2] but its 
programming has not been completed. 

The structural change specifies the changes to be performed 
on the tree being transformed in terms of tvo elementary trans- 
fozmatlons^ sister- ad junction and substitution (vhich includes 
deletion). Sister adjunction is an operation that adjoins tvo 
or more trees in an indicated order and substitution is an 
operation vhlch substitutes an adjoined sequence of trees 
or a single tree or a null tree for a given subtree of 
some tree B. If ah adjoined set of trees or a single tree 
is substituted for B then each substituted tree is connected 
to that node of B vhlch immediately dominated b'« If^ 
hovever, a null tree is substituted for b' then not only is h' 
deleted), but also higher structure emanating from B ' up to the • . 

ft 

•first node that dominates tvo or more nodes* 

t • • ■ • 
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The structural change Is an n-tuple vhere n is the 
number of tezms in the corresponding structural index. 

• I 

Each of its n tezms consists of either (l) the integer 
zero, denoting substitution of the null tree, ( 2 ) an integer 
J between 1 and n, denoting substitution of the Jth tree 
of the proper analysis >diich satisfies the structural index, > - ' " 

• (3) a specific morpheme, which need not necessarily be a 
tezminal symbol of the base ccmiponent, or (k) the adjunction 
of two or more of the members of (2) and (3). The ith terra 
of the structural change denotes the structure which is to be substituted 
for the ith tree of the proper cmalysis in question. 

If the Jth tezm of the structural index is a rule schema 
symbol X, then the Jth tezm of the structural change must be 
simply the number J, denoting that the structure designated 
by X is not to be modified. 



In specifying the domain of a transfozmation it is necessary 
not only to make use of the notion of a proper analysis with 
respect to a structural index but also to utilize supplementazy 
conditions. Metny different types of conditions have been 
suggested by linguists as appropriate for natural languages, 

* I , . * 

and a few types of conditions appear to be universally employed • 
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\>y transfoxnatlonal grammarians. One such type is the 

ft 

condition that tvo trees satisfying different terns of a 
structural index he identical^ and another type is the 
cc edition that a tree he dominated hy a prescrihed 
node and/or dominate other prescrihed nodes. Changes are 
currently being made to pexmlt use of all the different 
^ types of conditions that Bosenhaum uses in his core grammar [5] 
The currently avallahle version of the program, however, 
allows only the requirement of equality and the require- 
ment that a node dominate a prescrihed sequence of nodes. • 

To make clearer our fonoEdism for transfoxmatlons we 
present eui example using the LISP S-expresslon notation 
which we use as our input foznat. Consider the transfor- 
mation 

(qblig (((pbe:)) bp AUX V bp BI pass ((ALY)}) 

(1 5 (3 be EN) 4 0 6 2 8) . 

( ) PASSIVE). 

This transfoimation is an ordered ^-tuple expressed as a 
list of five elements. The first element, OBLIG in our 
example,'' denotes that this transformation is an obligatory 
singulaxy transformation. Obher choices for this first 
element axe OST for optional singulaxy transfoxmatlm. 



■iiUji-iiii 'mnvu'iniv n ■i-Ti-fi 

% • 

HEIFIBA.TED for repeated singulaxy transfonnatlon^ and BIMBQT 
for 'binaxy transfonoation. 

The second element of the transfoxmatlon is its stx*uctural 
index* Zn our exemple this is (((PPE)) HP AUK V KP BY PASS 
((ADY))). Texms consisting of a list of symbols enclosed in 
; double parentheses denote either no symbol or any one of the 
symbols thus enclosed. Hence, by (A B (((C)) D) ve mean (A B C D) 
or (A B D) and by (A B ((C D)) E) ve mean (ABC E), (A B D E), 
or (a B E)* a tern fomed by enclosing a list of symbols in 
a single^parenthests denotes the choice of exactly one symbol 
in* all of the possible vays. 

The third element is the structural change. Adjunction 
is indicated by grouping the set of trees to be adjoined, using 
a pair of parentheses. In our example the optional tree 
dominated by PBE is to be left alone (substituted for itself) 
as are the trees dominated by V and BY* The second tree 
dominated by KP is to be substituted for the first, the 
moxphemes BE and EN axe to be adjoined as ri^t sisters to 
the tree dominated by AUK, the second KP-tree is to be deleted, . 

the first KP-tree is to be substituted for the structure 

» • 

I 

dominated by PASS, and the tree dominated by ADV, if present, 
is to be left alone * 
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The fourth element is a list of pairs of trees that 
must he identical if the transfozmation is to he applicable* 
IV>r example^ Identity of the DET-trees and N*trees referenced 
in the hinaxy transfoxmation 



(BINAKT (X DET N SENTB DET ((ADJ)) N ((LCXJ)) AUX VP SENTB X) 

(1 (2 6) (3 8) 0 0 0 0 0 (THAT 9) 10 0 12) 

((2 5 ) (3 7)) SUHBL) 

is indicated hy the tern ((2 5) (3 7))» If a transfoimtion 
Inquires no conditions of Identity he met, it is still necessaxy 
to indicate this hy making the fourth element of the transfer* 
mation ( ) or liUL. 



The fifth element of a transfoxmation is simpjy the name 



of the transfoxmation* A sixth element can optionally he 
inseirted (making the transfoxmation a 6-tuple) to indicate 
that a node of a tree satisfying a given stx*uctural index texm 
must dominate a prescribed seg.uence of nodes. For example, 
if in the previous BIMARir trar ^foxmation ve add the sixth 
element ((2 (DEP))(10 (V WP))) then we indicate that the 
first D^-tree of a proper analysis must dominate the node 
DEP and the VP-tree must dominate V BP. It should he noted 



that ve'-do notxeq,uire the dominated nodes to constitute a 
conqplete proper analysis of the node that dominates them. 
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To illustrate the application of transfoxmtions to a 
deep structure given by a base component let us consider a 
transfoimational component consisting of the singulary 
tranefosnaation PASSIVE anA the BXN/VKir tranofoxmation pre- 
viously given ^ the fontier ordered first as ve require. We 
I Ckpply these rules to the base tree 



TH£ 






DET N COMP 




9ASS 



_ V BY 

.MW P/isr ^{^5g 

^ 

DET N fiU/B 

/ / • /V SMOKS. 

DEF MAN TNS 8£ XNA 

/ r 

THE p/^£S 




On the first cycle of applying rules the singulazy trans- 
fozmation is not applicable bub the binaxy transfoxmtion 
is because the proper analysis 



C 






^ t>Erx N >S6MT3, DET,N , AOX 
•tPF man hah AJJXA A^8 



I 

m 



7H£ 



VP ,56MTS, ^ 

V'^NP 

TNS BE INBr SMOKE W ^*’ 1 '^ J ** r>5'^ 

\ * PAST 5^F Eaa 
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satisfies the structural index of thh blnaxy transformation* 
POrfoxming this transformation gives the .tree 



SI 

/K 

NP AUX VP 

I Av. 

DET N COMP AUXA V NP BY PASS 

/ I NIK 

DEP MAN SI TNS LOSE DET N 

/ /K\ 1 1 

THE THAT AUX VP PAST DEP BOOK 

Ah I 

AUXA AUXB V NP THE 




THE 
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Going throufipi the transfoimtional cycle again from the 
beginning^ we see that the PASSIVE transfoxmation is applicable 
to this tree. Perfoxming it gives 



AUX 

/I I 

DET N AIKA 

/I I 

DEF BOOK 

/ I 

IHE PAST 





N COMP 

r I 

MAN SI 



THAT AUX 



VP 

As 



UP 



TMS 

/ 

PRES 



AUXA AUXB V 
BE ^ ING SMOKE DET N 

/ I 

DEP PIPE 

/ 

THE 



which is the surface structure that after verb-affix pemutatipn 
and morphophonemic tranfozmations would give the sentence^ 

**The boo}c was lost by the man that is smoking the pipe . ” 



There are a few general requirements that must be made on 
the. class of transfozmational grammars for which our program 
consitutes an analysis procedure. The first such general condition^ 
due to G. H. Ifetthews^ limits deletion by requiring that each 
deleted tree must be uniquely reconstructable from the transformed 
tree. More precisely^ the number of every term of the structural 

* . V 

index which is not a tezminal, symbol or transfozmationally introduced « 
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moxpheme must either appear somewhere in the corresponding 
structural change or else must he equated to some nu.iiber 
that does occur in the structural chEuige* 

A second general condition on a transfozmational greumnar 
G which is dictated hy recognition considerations is that thex«*! 

I he a hound on the depth of embedding as a function of the number 
of words in a sentence generated hy G. If this condition is 
not met^ a grammar can assign arbitrarily deeply embedded base 
structures to a sentence^ and, in particular, can assign an 
unbounded number of structural descriptions to a sentence. Hence, 
requirements on a grammar must be established which guarantee 
that the depth of embedding be bounded in order to make possible 
a recognition procedure whose computation is bounded as a function 
of sentence length* There are several ways of establishing such 
requirements* These are discussed in reference [2] but are not 
particularly germane to the user of the existing LISP analysis 
program, who must fix a maximum depth of embedding prior to 
analyzing a particular sentence (or set of sentences)* Ensuing 
analysis will be valid only for sentences whose maximum depth 
of embedding does not exceed this limit* 



A similar requirement on a transformational grammaj? is that 
there be a bounded number: of occurrences of the symbol COMP at 
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any emlsedding level. This requirement is alvays met in 
grammars of practical linguistic interest where recursion 
is pexmitted only through use of the symbol COMP* 



A final requirement is that the number of consecutive 
applications of a repeated transfoxmtlon which are allowed 
in a single cycle must be bounded* As with the previous 
requirement no loss of practical descriptive facility is 
caused by bounding the number of consecutive applications of some 



repeated transfoxination, for example, to the number of texminal 
symbols in the sentence being analyzed* 



This concludes our rather infozmal description of the 
class of grammars with respect to which our program finds all 
structural descriptions of a given sentence* In the following 

I 

section we give operating instructions and input-output fozmatsi 
It is hoped that the examples given will Illustrate the class of 
grammars for which ouh computer program constitutes a gene^l 
analysis procedure* 
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Operating Instructions 

• • • 

We lijnlt our consideration of computer operating 
Instructions and Input -output fomats to the IBM 7044'- 
7094 LISP 1*9 computer Implementations of our analysis 
algorithm. The versions of the program for the MIT CTSS 
^ system emd the AKIBL \MDlAO M-460 computer are essentially the 
same. 








Starblng vlth a basic LISP 1*9 system a packet of cards 
is 'read^ and overlord instructions which define and compile 
all necessazy functions are executed. Included in this packet 
are cards which set the values of certain constants Which are referenced 
as fzee variables by the various functions. 



To use the system it is first necessary to "set -up” 

(i^e. define) both the base component and the transfozmatlonal 
component '.of a grammar. A list of base component rules must 
be set as the value of the constant named GBAMMAR. For 
example^ for the gramnar 
SI HP VPl 
VPl -* VI 



VI -• APPEARS 
• HP -* IT COMP 
SI -»RF VF2 
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SI -« UP VP2 MfUf 
VP2 -• V2 UP 

" *# 

UP -♦ DET N 



' U -« 2X)GS 
• U -• CATS 



1 DET THE 



-»B3f P 



V2 -« LIKE 

com SEUTB SI SEUTB 



ve evaluate 



CSET (GEAMMAR ((Sl UP VPl)(VPl Vl)(Vl APPEARS) (UP IT COMP) 
(Sl UP VP2)(S1 UP VP2 MAN)(\/P2 V2 UP)(UP DET U)(U DOGS) 
(U CATS) (ijET THE) (MAU BY P) (V2 LIKE) ) ) 



Note that the rule COMP SEUTB Sl SEUTB is -understood 
and should not he included gus an element of the list GRAMMAR* 



In addition to the list GRAMMAR tvo other constant lists 



must he set up in specifying the base component. One is the . 
list TERMEEABLB i^ich associates ‘with each nontezminal symbol 
of the 'tose component a uniq^ue tezminal symbol. We illustrate 
the fonnat for the CF grammar ;)ust gi*ven. 
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CSE!P (TERMDABIE ((Sl S10)(VEL VP10)(V1 V10)(NP KP0)(VK VP20) 
(N NO) (DET DSTO) (MAN MANO) (V2 VgO) (COMP COMPO) (X XO) ) ) 



• ^ie particular fabricat on of tezminal eynibols (l*e^ 

• adding 0 to the corresponding nontexminal) is not essential* , 
Any LISP atoms not already used can he used. Note that the 
rule X -^XO must he included. The system req,uires this in 
^ order not to classify the rule schema, symbol X as a texniinal 
symbol of the base component. The necessity to fix the value 
of TEBMEABI^ stems from a requirement ve have of perfoxming CF 
. analysis on a sequence of trees ^ the nodes which dominate . these 
trees serving as texminal symbols for this analysis . We could 
have programmed the computation of a suitable value of TEIMABEE 



from the list GBAMMAN^ but we never got around to doing so. 



The last list which must be defined in conjunction with 
the base component is the set of rules reflecting derived 
constituent structure^ that is, phrase structure distinct from 
that implied by Just the base component but which can, nevertheless, 
be produced by the transfoxmational component operating on some 
base structure,. It is shown in [2] that there may be an unbounded 
number of such derived constituent structiire rules implied by a 
transfomational grammar.* It is further shown, however, that there 
are a bounded number of such rules that can arise from a deep structure 



whose depth of embedding is bounded. An algorithm is given for 



* Friedman [ll] shows that even if schemata are pesnnitted there can 
be no algorithm for constructing a surface graaniar for an axbi- 
traiy transfoznational grammar. 
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finding a set of phrase structure rules which includes all 

of the derived constituent structure rules that could he 

produced hy a given transfoxmational gratnmar operating on 

^ ‘ ' * o> structure whose maximum depth of embedding is less than 

*. • * ■ 

& given integer. Conditions on a transformational grammar ' 

are given in [2] which hound the maximum depth of embedding . 

, as a function of sentence length. Hence ^ for a sentence 

• of length n it is possible to find a set; of phrase structure 

cmles that includes a l l the derived constituent structure 

rules produced by a derivation of that sentence. As our 

program is presently organized^ however, it is necessary to 

decide upon a maximum depth of embedding for which a subsequent 

analysis will be valid and to provide the sufficient set of - - 

derived constituent structure (dcs) rules. A list of these 
• • • 

rules must be set to be the value of the constant AUXRULES. ‘ 

A program to mechanically compute a sufficient set of 
dcs rules for a given maximum depth of embedding has not 
been completed. Hence, at present two choices are possible. ^ • 

One can either produce by hand computation a sufficient set 
of dcs rules, using the method of [2], or else one can use 
for a particular sentence Just a set of rules that is known • * ‘ 

« 

• bo include all the dcs rules implied by some derivation (s) 
of that sentence; This latter procedure, of course, does 
not constitute a generally valid analysis procedure. It 

, • 

can fail to find certain ambiguous structural -descriptions. 
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^ ' . 

it xeqLuires at least a mental generation of the sentence 
It is^ however^ vorthwhile to use this vay of reducing the 
number of rules In AUXHUX£!S Just to insure that a fev ‘ 
sentences are Indeed generated as Intended by a relative?^ 

. C ■ economical computation. 

For the small CF grammar given above and the set 

. r 

, Of transfoxmations to follow ve will subsequently make 
use of the rules: 

« 

CSET (AUXRULBS ((COMP Sl)(Sl OMT IIP VF2) . 

(NP it) (VP2 be ED V2) (MAN BY NP) 

(Sl THAT WP VP2 im) (SI NP VPl Si) “ ' 

* -Vfc 

(si FOR TO NP VP2) (Sl FOR NP VP2) 

(VP2 TO V2 NP) (Sl FOR VP2) (Sl VP2) 

r 

(Sl VP2 MAN) (Sl FOR TO NP VP2 MAN) 

p 

(Sl FOR NP VP2 MAN) (VP2 TO BE ED V2) 

(Sl FOR VP2 MAN))) 

Nhen the three lists GRAMMAR, TERMTABLE, and ABDCRULES 
have been set, it is necessary to execute the LISP function 
of no arguments SETCFG. The command SETCFG ( ) sets up 
several tables necessary for subsequent CF grammar 
analysis. The CF analysis procedure used is the so-called* 
SBT algorithm of reference [3]. 

After setting up the base component it is possible to 
perform analysis. of sentences with respect to the CF grammar 
thus defined, (if only analysis with respect to a given CF 
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grammar Is vanted^ set its rules as the value of GBMIAH and 
set both TEBME/U3LB and AUXRULBS to NIL.) The function used 
is CFRBG. Its first argument is a sentence to he parsed^ * 
and its second argument is the nontenninal symbol which is 
to he considered the distinguished symhol of tbs grammar 
for the parsing. 

For the above grammar we have - . 



CFBBC ((IHE DOGS LXEB THE CATS) Si) a 

(((S1(HP(DET THE) (N DOGS)) (VP2(V2 HKE) (MP(DBr THE)(lf CATS)}))NIL)) 



€tnd 



CFREx; ((the dogs) NP) b 
(((np(det the)(n dogs))nil)) 



> .. 



The two values of CFREC above illustrate our notation 
for representing trees. The outermost parentheses reflect 
the fact that the output is a list of structural descriptions 
in general (although in our examples each of the two strings 
has one parsing). Each parsing is expressed as a list of two 
elements. The first element is a representation of a tree 
which is a stiTictural description of some initial substring 
of the :^rst argument of CFREC and the second element is the 
list of remaining unused terminal syinbols of the first 
argument which are not included in the substring parsed. 

The tree representation sinqply denotes the structure 
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.. by the linear notation (A^ A^ A^« • tvo trees represented 

above ayes 



HP 

A 

DET N 

/ I 

THE DOGS 



SI 



VP2 




and 






HP 

A 

bet n 

/ \ 

THE DOGS 



The transfozmtional component Is set up by the function 
SETTRANS whose single argument is a list of ordered transfozmations 
in the foimat already discussed* This function computes a list 
of so-called Inverse transfomations and fixes the value of the 
constant TRANS as this list* The inverse transfomnations and 
their role In the anaOysis procedure are discussed in [2]* The 

f onoat of a set of transfozmations for the base component cCLready 

# 

given is: 

• ' / 

* * . ’ 

* • 

\ 
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SBS'umss (( 

(OPT (X IT KP X)(l 2 (FOR TO 3) ^) ( ) C/iNTROl) 

(OPT (X IT UP X) (1 2 (that 3) ^) ( ) C/lNTR02) 

(opt (X HP (Vl V2) UP BY P X)(l 4 (BE ED 3) 0 5 2 7) ( ) PASSIVE) 

(OPT (X FOR TO HP (Vl V2 BE) X) ( 1 2 0 4 (3 5) 6) ( ) C/PJACEMENT) 

(OPT (X IT SI VP1)(1 2 0 (4 3)) ( ) EXTRAPOSITION) 

(OPT (X IT (Vl V2) FOR HP X) (l 5 3^06) ( ) PRO/rEPLAQB) 

(opt (x for to x) (i 0 3 ^) ( ) foVdeletion) 

(BIHARX (it SEHTB HP VP2 ((man)) SEHTB VPl) 

: (1 © 3 ^ 507 ) ( ) CHAHGEIEVELS))) 
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We give ‘belov the list of inverse transformations which is set 
to he the value of the constant TRANS vhen the above function is 
executed. . We will refer to these inverse transformations ' 

« 8 ubseg,uently in discussing several analysis examples. 

({(X TO X)(l FOR 2 3) OPT FOR/DBLETION(TO) ) 

t ' ' 9 ' 

(( X WP(V1 V?) FOR X)(l IT 3 4 2 5 ) OPT PR0/RBPLACE(F0R)) 

t 

j (( X IT VPl Sl)(l 2 4 3) OPT EXTRAPOSITION(IT)) 

((X FOR NP TO(Vl V2 BE)X )(1 2 4 3 5 6 ) OPT C/PLACEMENT(F0R TO)) 

((X NP BE ED(V1 V2) BY NP X)(l T 5 2 6 P 8 ) OPT PASSIVE(BE ED BY)) 
((X IT THAT NP X)(l 2 4 5 ) OPT C/lNTR02(lT THAT)) 

((X IT FOR TO NP X)(l 256 ) OPT C/IN™i(IT FOR TO)) 

(BINARY(IT NP VP 2 ((MAN))VP 1)(1 COMP 5 ) (2 3 4) CHANGE3!EVELS(IT))) 
Inverse singulary transformations are represented as five element 
•lists. The first element is the Inverse structural index. The 
second element is the inverse structural change. The third 

t 

• . 

element denotes the type of singulary transformation in 
question^ and the fourth. element is the name of that, 
transfoimaticn . The fifth element is a sequential list of 
the morphemes appearing in the inverse structural index. 

Aside from a different ordering^ inverse binary transformations 

V 

differ only in that in place of an Inverse structural change * 
there are two elements, the third and fourth, giving structural 
changes corresponding to the matrix and constituent sentences, 
respectively. We will defer further discussion of Inverse 

•x. 

transformations to the next section. 

_ To Illustrate the generative application of transformational 

■ * « 
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rules for the simple grammar given above ve giya a derivation 
of the sentence, "It appears that the dogs like the cats." 
;The underlying base structure is ‘ 



SI 

KP‘" 

IT COMP 



VPl 

\ 



VI 



A\ 



AEVEAIG 



SEmB SI SEMTB 

/\ 

NP VP2 
A /\ 

DET N V2 NP 

/ I \ tv 
THE DOGS LIKE \ X. 

DET N 
t I 
THE CATS 



■’I 

# 






ERIC: 



On the first cyc?be.no singulaxy transformations apply. There is, 
however, a proper analysis of the base tree with respect to the 
structural index of the binary transfozmatlon, 

(IT, SENTB, NP, VP2, ((MAN)), SENTB, VPl). 

Applying this transfozmation merely erases the sentence boundaxy 
symbols SEIfPB. On the second cyclic application of the 

transformational rules the transformation C/lNTR02..is applied to 
give: 

: \ ■ 




X 

APPEARS 






THAT NP VP2 

DET W V2. ^NP 
^ \ I 

THE DOGS LIKE DET 

THE 
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CATS 
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Perfoming the next transformation that applies to this 
structure^ EXTRAPOSITION^ gives: 



the teimlnal symbols of which are IT APPEARS THAT THE DOGS LIKE 
THE CATS. • 

^ Analysis of a sentence with respect to a grammar already 
set up is accomplished by means of the function RECOGNIZE. 

To analyze the sentence whose derivation appears above ^ for 
example, we evaluate RECOGNIZE ((IT APPEARS THAT THE DOGS LIKE 
THE CATS}}. Tl^ie resulting value is 

((((si(np ts (comp ((si (np (det the} 

(N D0GS}}(VP2 (V2 LIKE}(NP (DET THE} 

(N CATS} } } } }CHANGELBVELS} } (VPl 

(Vl APPEARS}}}} (C/INTRO2 EXTRAPOSITION}}} 



Note that the lists of transformations perfomed at each level 

* I 

of embedding are attached as right sisters to the sets of 
structures dominated by SI to which they apply. Note also 
that all occurrences of the symbol SBNTB are missing in this 
• output. We simply don^t add sentence boundazy symbols when 
connecting matrix and constituent sentences as reflected by 



the xiile COMP-h^SENTB SI SENTB. This cou3d, of ; course, be 



SI 




A 

DET N V2 NP 

/ » I ^ 



THE -DOGS LIKE DET^ N 



I i 



THE CATS 




iWll 



easily done^ but the fomalization’ of this paper vas not 
originally programed^ and later it vasn*t felt important 
enough to botlifir changing the output to conform vith the 
sentence boundary . formalism decided upon* Other analysis 
examples for the sample grammar are 
RECOGNIZE ((THE DOGS LIKE THE CATS)) » 

f 

. ((((S1(HP(DET THE)(N D0GS))(VP2(V2 ukb) 

(HP(DET the) (N CATS) )))) NIL) ) 

RECOGNIZE ((THE DOGS APPEARS TO LIKE THE CATS)) • 

((((si(NP it(comp((si(np (det the) ; 

(N D0GS))(VP2(V2 LIKE)(NP (DET THE) 

(N cats) ) ) ) )CHANGELEVELS) ) (VPl(Vl APPEARS) ) ) ) 

(C/INTROI C/PLACEMENT EXTRAPOSITION 

pro/replace por/deletion) ) ) 

RECOGNIZE ((IT APPEARS THAT THE CATS BE ED LIKE BY THE DOGS)) 

((((si(np it(comp((si(np(det the) 

(n dogs)) (vp2(v2 like) (hp(det the) 

(n cats))) (man by p)))changelevels)) 

(VP 1 (V 1 APPEARS)))) (C/INTR 02 PASSIVE 
• EXTRAPOSITION) )(((S1(NP IT (COMP 
■ ((S1(NP(DET THB)(N D0GS))(VP2(V2 LIKE) 

(NP(DET THB)(N CATS)))(MAN BY P») 

PASSIVE GHANGELEVELS) ) (VPl(Vl APPEARS) ) ) ) 

* * (C/IOTRO 2 

« * 
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HEXIIOGNIZE ((the a&TS APPEABS TO BE ED LIKE BY THE DOGS)) 

((((si(NP it(comp((si(kp(det the) 

« * 

. * (N D0GS))(VP2(V2 LIKE)(NP(DET THE) 

' • . (n cats))) (man by p)))changelevels)) 

(VPl(Vl APPEAES))))(c/lNTR01 PASSIVE 
C/PIACEMEKT EXTRAPOSITION PRO/REPLACE 
^ FOI^DELETION))(((Sl(l^ IT(C0MP 

((S1(NP(DET THE)(N D0GS))(VP2(V2 LIKE) 

. (NP(DET THE)(N CATS)))(MAN BY P))) 

PASSIVE CHANGELEVELS))(VP1(V1 APPEARS)))) 

H[c/IHTR01 C/PLACEMENP EXTRAPOSITEON 
pro/replace POi^DELETION))) 

To simplify this sample grammar ve have omitted a verb- 
affix pezmutation transfoimation^ number agreement transfozmations^ 
and morphophonemlc transfoxmations. This simple grammar vaa 
written by Peter Rosenbaum as an exercise in learning the 
conventions required by our system. 

We observe that the last two sentences can each be produced 
by two distinct derivations which differc only with respect to 
whether the PASSIVE transfoxmation is perfoxmed on the first cycle 
or on the second. If this transfozination is made obligatory > 
only a single derivation is allowed^ of course. 

If the user wishes to follow the course of the recognition 

procedure, ah option is possible that gives extra printing. 

« 

This is indicated by evaluating CSET (SW3 T). Conversely 
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setting the value of the constant SW3 to NIL causes the extra 

printing to he . repressed. If S173 Is sot to T^ the following 

{ 

printing results for a simple example: 

necognize((the dogs like the cats)) 

CONTINUATION NO. 0 ‘ 

(the dogs LUCE THE CATS) 

for/deletion 

pro/repiace 

EXTRAPOSITION 

C/PLACEMBNT 

PASSIVE 



C/INTRO 2 

C/iNTROl 

CHANGELEVELS 



BASE 



((((S1(NP(DET THE)(N DOGS)) (VP2(V2 LIKE)(NP(DET THE)(N CATS) )))) NIL) ) 



Names of transformations are printed as the corresponding inverse 
transformations are tested for applicability. In the above simple 
example no inverse transfozmations were applicable, so after 
exhausting them the given sentence was parsed with respect to 

$ 

the base compohent CP grammar, giving the req,uired structuaral 
description. , 

A mors complicated example is the following: 
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BECOGNIZE((XF AFPE/IRS THAT THE DOGS LIKE THE CATS)) 
CONTINUATION NO. 0 

(IT AFPBABS THAT THE DOGS LIKE THE CATS) 

. F0I|/DELETI0N 

,ero/repiace 

EXTRAPOSITION 

. . ((X) IT (VP1(V1 APPEARS)) (SI THAT(NP(DET THE)(N D0GS))(VP2 
(V2 LIKE) (NP(DET THE) (N CATS))))) 

(IT THAT ^ DOGS LIKE THE CATS(VPl(Vl APPEARS))) 

• • 

. PSTEST 





1 

i 

.♦ 





1 



Passed 

•J«**1TH CONTIHUATION 

I • 

• ' ' ' 'W, ‘ • 

■ C/PLACEMENT - 
PASSIVE 
C/INTR02 
C/iNTROl 
CHANGEIEVELS 
'BASE 

* • * 

NO CONTINUATION 
CONTINUATION NO. 1 

(IT THAT THE DOGS LIKE THE CATS(VPl(Vl APPEARS))) 

C/PLACEMENT 

t 

PASSIVE 

t ^ 

C/iNTROS 

- ((X) IT THAT (NP(DET THE)(N D0GS))(X LIKE THE CATS(VPl(Vl APPEARS)))) 
(IT(NP(DBT THE)(N dogs)) LIKE THE CATS (VPl(Vl APPEARS))) 

PSTEST 
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X 





Passed 

***2m CONTINUATION 



a I 



C/iNTROl 

CKANGELEVELS 

BASE 

> 

NO CONTINUATION 

t • 

4 

CONTINUATION NO. 2 

(it(np(det the)(n dogs)) like the cats(vpi(vi appeabs))) 

C/iNTROl 

CHANGEIEVELS 

(IT(NP(DET THE)(N D0GS))(VP2(V2 L3KB) (ip»(DET THE)(N CATS))) 
EMPTY (VP1(V1 APPEARS) ) ) 

***3TH CONTINUATION 









( IT COMP(VPl(Vl APPEABS) ) ) 

PSTEST 

Passed 

((NP(DET THE)(N D0GS))(VP2(V2 LIKE) (NP(DET THE) (N CATS)))) 

PSTEST 

Passed 

CONTINUATION NO. 4 ' 



((np(det the)(n dogs) )(vp2(v2. like} (np(det the)(n cats)))) 
fob/delbtion 

pro/beplace ... 

extraposition 



c/placement 



PASSIVE 
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C/IKTB02 

C/iNTROl 

CHANGELEiVEIS 

MSE 

((si(np(det the)(n dogs))(vp2(v2 likb)(np(dbt thb)(n cats))))) 

***5TH CONTINUATION 
CONTINUATION NO. 5 

(it(comp((si(np(det the)(n dogs))(vp2(v2 likb)(np(det the) 

(n cats)))))changelevels)(vpi(vi appears))) 

BASE 

((Sl(NP IT(C0MP((S1(NP(DET THE)(N D0GS))(VP2(V2 like)(np 
(DET THE) (N CATS) ) ) ) )CHANGELEVELS) ) (VPl(Vl APPEARS) ) ) ) 

CONTINUATION NO. 3. 

(it(np(det the)(n dogs)) iikb the cats(vpi(vi appears))) 

BASE 

NO CONTINUATION 

( ( ( (S1(NP IT(C0MP( (S1(NP(DET the) (N DOGS) ) (VP2(V2 LIKE) (NP 
(DET THE) (N CATS) ) ) ) )CHANGELBVEi:.S) ) (VPl(Vl APPEARS) ) ) ) 
(C/INTRO2 EXTRAPOSITION))) 



We give a brief commentary on this print out belov. It 
is not meant to be an explanation of hov the analysis procedure 
vorks. Rather it merely illustrates the output fozmat for 
displaying the intermediate structure that is pieced together 
At each stage of the analysis In terms that should be clear 
to the reader familiar vlth reference [2]. Other readers 









' vlll understand vhat sec^uence of computations Is being made -but 
will not have any understanding of vhy this sequence constitutes 
. a complete analysis. 

The FOI^DELETION and PRO/rePIACB inverse transformations 
' are not applicable. The gir.«n sentence is analy liable into a 
sequence of trees satisfying the inverse structural index 
of the inverse EXTRAPOSITION transfoimatlon, however. This 
sequence of trees is given on the line after the name of the 
transformation EXTRAPOSITION, and on the subsequent line is 
found the transformed sequence of trees formed by the inverse 
EXTRAPOSITION transformation. The PSTEST appearing on the 

V.V 

next line refers to a necessary tes-f} which must be passed if 

V 

a sequence of trees resulting from an inverse transformation is 
to lead to a valid structural description. If such a sequence 
of trees is to be further considered, it must be a proper 
analysis of a derived structure tree at some point in the 
derivation of the sentence In question. Hence, it must be 
possible to build up structure from this sequence of trees to 
the sentence symbol SI, using both the original base rules and 
the derived constituent structure rules included in the list 
AHXRUIES. The message PASSED on the line after PSTEST indicates . 
that this necessary test was passed. 

The next line •’w^nTH CONTINUATION indicates that the 

transformed sequence of trees is being stored on a pushdown 

> 

list as continuation number one* 

The subsequent list of transformation names refer to inverse 
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tran^foxmatioxB considered for poteAtlal perfoxmnce on the 
untransfomed seq.uence (IT APPE/IBS TH/IT THE DOGS LIKE THE 

cats). When this continuation eventually falls, the 

* • • . * 

transfomed sequence of trees (continuation number one) 

f 

' is taken from the pushdovm list« 

The Inverse transformations C/PIACEMEHT and PASSIVE do not 
, , apply hut C/IINTRO 2 does apply. The test PSTEST on the 

I • 

transfoxmed sequence of trees Is passed and this sequence Is 

stored as continuation number tvo. The continuation 

resulting from not applying the inverse c/lNTR02 transfoimatlon 

fails with no further Inverse transfoimations being applicable 

and continuation two is next considered. Inverse c/lKTEOl 

• ^ 

Is not applicable, but the inverse of binaiy transformation 
CHANGEIEVEIS Is applicable. In the case of Inverse binary 
transformations the' order of' considering transfoAned and 
uhtransformed sequences is reversed. The untransformed sequence 
Is stored away as continuation three, and that pairt of the 
inverse binary transformation Is performed which gives the 
matrix sentence continuation. This sequence, 

(IT C0MP(VP1(V1 APPEARS) ) ) , 

passes the test PSTEST as does the constituent sentence sequence 

((np(det the) (n dogs)) (vp2(v2 like)(kp(det the)(n cats)))). 

The constituent sentence continuation (which is labelled 
continuation number four) is next subjected to the entire Inverse 
.transfoxmatlonal cycle. No tranafoxnations are performed, but 
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the base structure 

(Sl(HP(DET THE)(M DOQS)) (VK(V2 LIKE)(MP(DBI m)(K CATS)))) 

Is found. This structure Is then attached to the symbol 
’ COMP of the matrix sentence continuation and stored as 
• continuation number five. This continuation is 

immediately retrieved and built up to SI by means of the > - 

, base cr^.^,^nent rules, giving a complete structural description* 

I , 

Continuation three is then pursued. This continuation dies 
immedlateily because it is not possible to build the sequence 
up to SI using base component rules. Finally, the structural 
description found is printed out. 

In addition to the printing option controlled by the 
■ value of the constant SW3> there are several other constants 
vhose value gives options . Constants NOTEST and NOTESTl deteimine 
whether the test PSTEST is to be carried out after perfoimance 
of slngulary and binary transformations, respectively. A value 
of NIL indicates the tests, are to be performed and a value 
of T Indicates they are not to be performed. Setting ONECPL 
to T is appropriate only when it ii^^own prior to analyzing 
a sentence that that sentence contains no more than a single 
COMP at any level of embedding. If ONECPL is NIL no such 
simplifying assumption is made. SDSASFOUND is a constant which, 
if its value is T, signals that each structural description is 
to be printed as soon as it is found. Setting SDSASFOUND 
^to NIL indicates structural descriptions are to be printed only 
after the analysis procedure has been completed. Finally, the 
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•value of NOOFPARSINGS controls the number of structural 
descriptions with respect to a given set up context-free 
grammar which are to he computed hy the function CFBEC* The 
. self-explanato^ possible values are ONE and ALL, and only 
■ the latter is appropriate for use in analyele with reopeot 
to a transformational grammar. Setting NOOFPAESINGS to 
j ONE is useful if for some reason only a single structural 
description of a sentence with respect to a given CP grammar 
. ^ Is re(iuired. 

Decks of cards containing the version of the system we 
have described are available to any interested group. These 
are appropriate for the IBM 709^ 7044 LISP systems. . They 

consist of LISP packets which define and compile the functions 
needed, set req,uired values of all constants, set up the simple 
grammar of this paper, and analyze a few sentences. This, in 
.conjunction with the present paper, should prove svifficient 
to permit the ‘testing of transfonnational grammars written by 
individual writers. 

Discussion 

We ha-ve defined a class of transfonnational grammars and 
have described how to use a computer program which detemines * 
all of the structural descriptions of a sentence with respect 
to an arbitraxy member of this class of grammars. It is hoped 
that we have done this well enough to permit the use of our 
• syntactic analysis system by any interested investigator. 
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Our vork on syntactic analLysis contrasts in several respects 
with that of other groups. The principal difference, obviously, 
is that few efforts have been directed toward the analysis of > » 
sentences with respect to transformational grammars. Limiting 
’ our attention to analysis procedures for transformational 
grammars, we find work by Matthews ^6,7,®] Zwicky et al 9 3 

and Kuno IlP] . ^feitthews was the first to write on the trans- 

I 

t 

foxmational recognition problem. Briefly, he showed how for 
every structural description of each sentence of a language 
defined by some transfoimational grammar, it is possible to 
set up a unique integer (specifier) that denotes a unique 
transformational deriva-tion of the sentence in question. By 
obtaining a bound f (n) on the specifier of a sentence consisting 
of n terminal symbols, he demonstrated that it is in principle 
possible to obtain all structural descriptions of a given 
sentence by applying the rules through a synthesis procedure 
no more than a bounded number of ways. He observed that actually 
following such a procedure would be prohibitively time consuming, 
and he proposed to make the procedure feasible by performing 
certain preliminary analyses on the sentence so as to preclude 
large ‘numbers of potential specifiers and bo reduce the amount, 
of exhaustive sentence synthesis required to a reasonable 
magnitude'. In his last paper he' gave details of how a system 

of specifiers could be set up for a so-called deep structure 

. ^ 
r transfoimational grammar, and he began to consider details of a 

preliminaxy analysis procedure. Tills system has not been 
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programmed to date. Our analysis procedure finds all structural 
descriptions of a given sentence along with, possibly, one or ' 
moxe incorrect structural descriptions. For a discussion of 
why a synthesis phase is necessaxy see Section 3*8 of [a] . 

Hence, this procedure mxxst be \ised as a preliminary analysis 
procedure in conjunction with an analysis->by»synthesis algorithm 
j as suggested by Matthews. Our procedure differs from his in 
that we do not eliminate blocks of sequential specifiers, 
testing other blocks by means of a synthesis component. Instead, 
we generate a set of potential specifiers that must include 
every valid specifier, and we test each of these with a subsequent 
synthesis. 

A distinctly different approach has been utilized by the 

Language Processing Techniques Sub-department of the MITBE 

- 1 * 

Corporation* Briefly, they make use of a so-called CF surface 
grammar that assigns to each sentence all of the final derived 
constituent structure trees that are specified by a given trans- 
. foimational grammar. The surface grammar may also assign unwanted, 
erroneous derived constituent struct\ire or even generate sentences 
not defined by the given transfozmational grammar. In the 
MCTHB approach inverse transformations which map trees into trees 
are applied in reverse generative order to the set of surface 
trees given by the surface grammar, and the resulting trees 

are checked to Insure that they satisfy the given base component 

• 

phrase structure rules. Finally, potential structural descriptions 
thus obtained are submitted to a synthesis component for veri- 
< ficatlon or rejection. 

.41 

« • 






'jm MM 



O 









I 



The work by Zwicky et el et the METBE Corporation has been 
directed toward achieving somewhat different goals than our own* 

- 

Whereas we insisted upon providing an analysis procedure that 
is valid for a class of grammars , the work at METRE has been 
concerned with a single grammar* (Although the scope of this 
grammar has been continually extended^ at every point its 
I associated analysis algorithm has been individually constructed 
for the existing grammar)* In addition, we have required that 
our procedure find all structural descriptions assigned to a 
sentence by a given grammar* The METRE analysis program, on the 
other hand, is not based upon underlying theoretical considerations 
which guarantee all structural descriptions of a sentence will 
be found*. There are, however, no known Instances where the 
analysis program is known to fail to produce all of the structural 
descriptions assigned to a sentence by the so-called JUNIOR Grammar 
of reference C9]. 



As might be expected, a rather high price is paid for the 
generality and theoretical validity of our program. This is 
bourne out by comparative computing times, even though there 
were tremendous differences in the computers that were used, 

in the grammars that were considered, in the programming language 

• * . *• * 
systems that were employed, and in the implementation details 

that were utilized* The total processing time of ^.11 minutes ^ 






; igtf 



r r 



. •!' ■ ■■■ • — "x M ii wcfti TTVr ii iN i nrT^'^ 



pM 









for 28 sentences cltecl by MITRE contrasts with our own processing 
time requirements which have ranged from a few seconds to as 
much as ten minutes i>er sentence on the IBM The undoubt jiiy 

greater speed of the MITRE program makes its use more feasit.^e 
for natural language computer programnlng applications . It is * 
not, however, suitable for use by a linguist who wished to 
test his own transformational grammar to ascextain that it assigns 
structural descriptions as intended. 

Kuno [lol has also considered the analysis problem for 
transfoxmational grammars. He has observed that for many trans- 
format ional grammars which have been written to describe subsets 
of English it is possible to predict contiguous portions of the 
deep structure assigned to a sentence from corresponding dis- 
tinctive local blocks of surface structure. His proposed approach 
is to make use of a standard form CP grammar which generates 
the sentences specified by a given transformational grammar and 
to develop c procedure for mapping structural descriptions assigned 
by the standard fozm grammar directly into those deep structures 
assigned to the sentence by the transfoimational giesnmar. No 
algorithm has yet been developed for 'computing the required standard 
form grammar or for devising a general procedure for mapping its 
structural descriptions into base structures assigned by a given 
transformational grammar* 
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We are still working on extending the version of the program 
described in this paper. The analysis procedure is being modi- 
fied to be valid fpr grammars which make use of complex features 
. for restricting the lexical rewriting portion of the base compo- 
. nent. We will also pexmit transfoxmations to make references to 
features. Additional facilities will also be added to permit the 
statenient of generalized structural conditions on the applicability 

I 

of transfoxmations. These modifications should pexmit consideration 
of grammars employing essentially the foxmalism used by Rosenbaum 
in his core grammar [ 5 3 • 

Several additional programs are being written to make our 
present system easier to use. The mechanical computation of 
derived constituent structure rules (AUKBULES) required for analysis* 
of sentences whose depth of embedding does not exceed a given 
constant is being programmed. A synthesis component is also 
being programmed. This is required not only to eliminate 
structures which cannot be transfoimed to produce the given 
sentence, but also is required as an independently useful tool 
in testing a transfoxmational grammr. By means of a synthesis 
component many gross errors should be detected moze economically 
than would be possible if only an analysis program were used. 

I 

Finally,, we are continually making changes designed to make 

« 

our analysis psracedure more efficient. Changes- being made to 



XV 



increase .programming efficiency Include development 'of selected 
machine -coded subroutines and Increased use. of LISP property 
lists. Changes to the underlying theoretical analysis procedure' 
vlll probably do even more to increase efficiency. Improvements 
currently being programmed include modification of the analysis 
procedure (l) to make continual use of restrictions during the> ^ 
course of an analysis vhich are associated yith the requirement 
that an obligatory transfoxmation be performed if applicable, 
and (2) to make use of selectively applicable AUXHULES at dif- 
ferent points in an analysis. It is hoped that these changes 
vlll result in a system-. vhich is significantly more useful than 
the present system for testing transfozmational grammars and 
for application to systems that allov man-machine communication 
by means of a useful subset of English. 
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